SEQUENCE LISTING 

SEQ ID NOs:l and 2 

LOCUS COTMYBA 1006 bp mRNA PLN 31 -DEC- 1993 

DEFINITION Cotton DNA-binding domain mRNA. 
5 ACCESSION L04497 
NID g437326 
VERSION L04497.1 GL437326 
KEYWORDS . 

SOURCE Gossypium hirsutum (cultivar Acala SJ-2) 3-day pre-anthesis ovule 
10 cDNA to mRNA. 

ORGANISM Gossypium hirsutum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core 
eudicots; Rosidae; eurosids II; Malvales; Malvaceae; Gossypium. 
15 REFERENCE 1 (bases 1 to 1006) 

AUTHORS Wilkins,T.A. and Lu,C.-C. 
JOURNAL Unpublished (1993) 
FEATURES Location/Qualifiers 
source 1..1006 
20 /organism—'Gossypium hirsutum" 

/cultivar=" Acala SJ-2" 
/db_xref^"taxon:3635" 
/dev_stage="3-day pre-anthesis" 
/tissue_type="ovule" 
25 mRNA 1..1006 
CDS 59.. 943 

/note="MYB A; putative" 
/codon_start=l 
/protein_id="AAA33067. 1 " 
30 /db_xref="PID:g437327" 
/db_xref^"GI:437327" 

protein_bind 92.. 403 

/note= !t putative ft 
35 ^un^moiety^'MYB" 
repeat_region 1 07 . . 1 3 3 

/note- 'MYB DNA-binding domain repeat signature 1; 
putative" 
misc_feature 3 23 3 94 
40 BASE COUNT 323 a 209 c 204 g 270 1 
ORIGIN 

1 taacaccgtt attctttctc tattctacct gatttgattt gatttgattt tgtaactgat 
61 gggacgatca ccttgttgtg aaaaggctca taccaacaaa ggtgcctgga ccaaagagga 
121 agatcaacgc ctcatcaact acatccgtgt ccatggtgaa ggctgctggc gttccctccc 
45 181 caaagctgct gggctgctta gatgtggtaa gagttgcaga ttaagatgga taaactactt 
241 gaggcctgat cttaagagag gaaatttcac tgaagaagaa gatgagctta tcatcaagct 
301_tcacagttta -cttggaaaca-aatgg^ 



361 taatgagata aagaactact ggaacacaca catcaaaaga aagcttataa gcagaggaat 
421 tgatccacaa actcatcgtc ctctcaatca aacggccaat accaacacag tcacagcccc 



1 



481 caccgaattg gatttcagaa actcgcccac atccgtttcc aaatccagtt ccatcaaaaa 
541 cccgtctctg gatttcaatt acaatgaatt tcaattcaag tccaacacag attcccttga 
601 agaacccaac tgtacagcca gcagtggcat gactacagat gaagagcaac aagaacagct 
661 gcacaagaag cagcaatacg gtccgagcaa tgggcaagac ataaatttgg agctgtcgat 
5 721 tgggattgtt tcagctgact catctcgggt atcaaatgcc aactcggccg agtcgaaacc 

781 aaaggtagat aacaacaatt tccagtttct tgaacaagct atggtggcta aggcggtatg 
841 tttgtgttgg caattaggtt ttggaacaag tgaaatttgt aggaactgtc aaaattcaaa 
901 ttcaaatggc ttctatagtt attgtagacc cttggattca tagggtcatc tttttcttct 
961 ttctttctgt ttttaggaga taaattaatg cttaattatt aaaaaa 

10 



MGRSPCCEKAHTNKGAWTKEEDQR^ 
WINYLRPDLKRGNFTEEEDELII 

KLISRGIDPQTHRPLNQTANTNTVTAPTELDFRNSPTSVSKSSSIKNPSLDFNYNEFQ 
15 KSNTDSLEEPNCTASSGMTTDEEQQEQLHKKQQYGPSNGQDINLELSIGIVSADSSRV 
SNANSAESKPKVDNNNFQFLEQAMVAKAVCLCWQLGFGTSEICRNCQNSNSNGFYS 
YCRPLDS 

20 SEQ ID NO:3 and 4 

LOCUS AF034134 1151 bp mRNA PLN 02-MAR-1998 

DEFINITION Gossypium hirsutum MYB-like DNA-binding domain protein (Cmy-O) 

mRNA, complete cds. 
ACCESSION AF034134 
25 NID g2921339 

VERSION AF034134.1 GI:2921339 
KEYWORDS . 
SOURCE upland cotton. 
ORGANISM Gossypium hirsutum 
30 Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core 
eudicots; Rosidae; eurosids II; Malvales; Malvaceae; Gossypium. 
REFERENCE 1 (bases 1 to 1 151) 
AUTHORS Loguercio,L.L., ZhangJ. and Wilkins,T.A. 
35 TITLE Structure and expression of six classes of myb-domain genes in 
allotetraploid cotton (Gossypium hirsutum L.) 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 1151) 
AUTHORS Loguercio,L.L., ZhangJ. and Wilkins,T.A. 
40 TITLE Direct Submission 

JOURNAL Submitted (13-NOV-1997) Agronomy & Range Science, University of 
California, One Shields Ave., Davis, CA 95616-8515, USA 
FEATURES Location/Qualifiers 
source 1..1151 
45 /organism-'Gossypium hirsutum" 

/c ultivar="Acala SJ-2" 

7db_xref="taxon:3635" 
/dev_stage="3 days pre-anthesis" 
/tissue_type="ovule" 
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gene 



1..1151 
/gene="Cmy-0" 
/note= M MYB-domain gene O" 
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CDS 



72..7S2 
/gene="Cmy-0" 

/function-'putative MYB-like transcription factor" 
/note- 1 similar to MYB A encoded by GenBank Accession 
Number L04497" 
/codon_start=l 

/product= ff MYB-like DNA-binding domain protein" 
/protein_id=" AAC04720. 1 " 
/db_xre^"PID:g292 1 340" 
/db xref="GI:2921340" 



15 BASE COUNT 382 a 215 c 240 g 314 1 
ORIGIN 

1 cggattttct ttccccgtgt ttggttgcac agaaagtgag agaaagtttt acttttgatt 
61 ttgaaactcc gatgagaaaa ccttgctgcg ataaacaagg caccaacaag ggagcctggt 
121 ccaagcaaga agatcaaaag ctcattgatt atatacgtat tcatggtgaa ggctgttggc 
20 181 gttccctccc caaagctgca ggtttgcacc gttgcggtaa aagttgcagg ctgagatgga 

241 taaattactt aagaccagat atcaaacgtg gtaactttgc tcaagacgaa gaggacttaa 
301 ttatcaaact ccatgctctc cttggtaacc ggtggtcact gatagctggt agattaccag 
361 gaagaacaga taatgaagtg aagaactatt ggaattccca tataaagaga aagctaatga 



MRKPCCDKQGTNKGAWSKQEDQKLIDYIRIHGEGCWRSLPKAAGLHRCGKSCRLR 
W1NYLRPDKRGNTAQDEEDLIIKLHALLGNRWSLIAGRL 
40 RKLMKMGH)PNNHKLNQYPHHV 

YLEDATPPTGISNLDLDLTIAFPSSPIKNIIEESQQKTASIVTNDEEEQYTVPTLLLFR 

repeatregion 1 02 . .25 7 

/note- putative MYB DNA-binding domain repeat R2" 
repeat_region 2 5 8 . .4 1 2 
45 /note="putative MYB DNA-binding domain repeat R3" 



25 



30 



35 



421 agatggggat cgatcctaat aaccataagt tgaaccaata tcctcatcat gttggtcccc 
481 ttaaccccac caccaccaac tccatggatg tggcatgtaa gcttagagtg tgttcaacag 
541 acaatgatga tgggatctca gatgctgcaa gttatctcga agacgcaaca ccgcccactg 
601 gtatatccaa cttggacctt gatctcacaa ttgcttttcc ttcgagtcct atcaagaata 
661 ttattgaaga aagccagcag aaaacagcat ctattgtaac aaatgatgaa gaagaacaat 
721 atacagtccc tacccttctt cttttcagat gagacaaaaa aaaaagcctc acacatgtgg 
781 agattcgtgc aaaagaccta aaggcttacg aaggcaacat gcacgccatt gtcaaattct 
841 tttggatgat ggattgaaac catatccttg tccattagaa aggaggaaga taagctaaaa 
901 ctgtattatt gtgtataaat ttggtagaaa gaaagatttc aacttaagaa ttaggatcaa 
961 ataactgaat gaatgaacga attgcagata agttgttagg aggttttcaa tcaacttatc 
1021 tgcaattaat ttggtggagc tgatgtagga tgatgagttc atcgtacatg aactgaacct 
1081 ttgatatttc aggctctaat tgtctgtttg tatgcgtaaa gatattcttc aatgtgagat 
1141 cagctaaaaa a 
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SEQ ID NO: 5 

Cotton (Gossypium hirsutum L. cv. Acala SJ-2) 10 dpa fiber cDNA 
R2R3-MYB transcription factor 
5 GhMYB7 



ID 6W DNA; 1081 BP. 
10 SQ SEQUENCE 1081 BP; 353 A; 228 C; 240 G; 260 T; 

CTCCCCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC CGGGCTGCAG 
GAATTCGGCA 

CGAGGAAAGA AGTGTGAAAA AAAAAATGGG AAGGAGTCCT TGTTGTTCTA 
AGGAAGGCCT 

1 5 TAAC AGAGG A GCTTGGACTG CTCTTGAAGA C AAAATTCTT AAAGATTAT A 
TCAAAGTACA 

CGGTGAAGGT CGTTGGAGAA ATCTCCCCAA AAGAGCTGGT CTTAAGAGAT 
GTGGGAAAAG 

TTGTAGGCTT CGGTGGTTGA ATTATTTGAG ACCTGATATT AAAAGAGGTA 
20 ACATATCACC 

TGACGAGGAA GAGCTTATCA TCAAACTCCA CAAACTCTTG GGAAACAGAT 
GGTCTTTGAT 

AGCTGGGAGG CTTCCAGGAC GAACAGACAA TGAAATAAAG AATTACTGGA 
ACACCAACTT 

25 AAGTAAAAGA GTTTCCGATC GTCAAAAGTC ACCCGCCGCT CCTTCGAAAA 
ATCCCGAGGC 

GGCTCGACGA GGAACTGCTG GTAATGGCAA TACCAATGGT AATGGTAGTG 
GTAGTTCCTC 

GACACACGTG GTGCGGACAA GGGCGACAAG GTGCTCCAAG GTTTTCATAA 
30 ACCCTCCTCA 

CTACACACAA AACAGAGACC CAAAGCCTTC TTCAACTTGT TCAAATCATG 
GGGATCACCG 

GGAACCTAAA ACAATGAATG AGTTGTTATT ACCGATAATG TCAGAATCCG 
AGAATGAAGG 

3 5 GACGACCGAT C AT ATATC AT CGG ATTTT AC ATTTGACTTC AAC ATGGGAG 

AGTTTTGTTT 

ATCGGATCTT TTGAATTCCG ATTTCTGCGA TGTAAACGAG CTTAATTACA 
GCAATGGTTT 

TGATTCGTCA CCCTCACCGG ATCAGCCTCC TATGGATTTC TCCGACGAAA 
40 TGCTAAAAGA 

GTGGACGGCC GCCGCCTCCA CTCACTGCTG TCACCAAAGT GCGGCTTCCA 
ATCTCCAGTC 

CTTGCCTCCA TTTATTGAAA ATGGAATTGA ATGACCTTGA AAAAATAAAA 
GACGAAAAAT 

45 ATTTTCTCAT GTAAACTAAA TAAACACATC TTCCATCATT AAAAAAAAAA 
AAAAAAAAAA 
A 

/ ■ 
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SEQ ID NO:6 

ID 6WPPRT; 302 AA. 

DT 09-NOV-l 999 (CREATED BY PC/GENE PROGRAM TRANSL) 
5 DE 

CC TRANSLATED FROM DNA SEQUENCE 6W (BASES 86 TO 993). 
SQ SEQUENCE 302 AA; 33766 MW; 451675 CN; 

MGRSPCCSKE GLNRGAWTAL EDKILKDYIK VHGEGRWRNL PKRAGLKRCG 
KSCRLRWLNY 

1 0 LRPDIKRGNI SPDEEELIIK LHKLLGNRWS LIAGRLPGRT DNEIKNYWNT 
NLSKRVSDRQ 

KSPAAPSKNP EAARRGTAGN GNTNGNGSGS SSTHVVRTRA TRCSKVFINP 
PHYTQNRDPK 

PSSTCSNHGD HREPKTMNEL LLPIMSESEN EGTTDHISSD FTFDFNMGEF 
15 CLSDLLNSDF 

CDVNELNYSN GFDSSPSPDQ PPMDFSDEML KEWTAAASTH CCHQSAASNL 
QSLPPFIENG 
IE 
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Cotton (Gossypium hirsutum L. cv Acala SJ-2) 10 dpa fiber cDNA 

R2R3-MYB Transcription Factor 

GhMYB8 

5 SEQ ID NO: 7 

ID 19W; DNA; 944 BP. 

SQ SEQUENCE 944 BP; 319 A; 163 C; 214 G; 246 T; 2 OTHER; 

TGGAGCTCCC CGCGGTGGCN GNCGCTCTAG AACTAGTGGA TCCCCCGGGC 
TGCAGGAATT 

1 0 CGGC ACGAGA CTCC AACAAA TGTCC ATGAA AAAAGAAGGT GAAATTCTAT 
ACAAAAAGGG 

ATTATGGGCA ATGGAGGAAG ACAAGTTACT CATTGATTAT GTCAATGTCC 
ATGGAAAAGG 

ACAATGGAAC AAAATAGCCA ACAGAACAGG TTTGAAGAGA AGTGGGAAAA 
15 GTTGTCGGCT 

AAGGTGGATG AATTACCTGA GTCCTAACGT TAAAAAGGGT GATTTTTCTG 
AAGAAGAAGA 

AGACCTCGTC ATTAGACTTC ATAAGCTTCT TGGAAACAGG TGGTCTTTGA 
TTGCGAAACG 

20 AGTTCCAGGT CGAACTGACA ATCAAGTCAA GAATTACTGG AATAGTCATT 
TGAGGAAGAA 

ACTAGGGATC ATTGATCAAA ACAAGACAAG GATCGATTTT TGTCAAAGTT 
CAAAGCAAGT 

CAAAGTGTGT CATGTTGATG AGGCAGCCAC GGATCCAAGT CCTGGACATG 
25 GAACAACCAC 

TGAAACCACG GGTATAACAG TGGATCAGAG TAACCAGCAG GAAGTCATTG 
ATCATCGGGT 

CTTAAACAAT ACTACTCAAG AATCAATGAC CAGTGAGAGT TATATCAACA 
CTTTCTGGAT 

30 TCCTGACCAT GATTATGAGC TAAGTACACT TGCCATGATT GACCATGATT 
ATGAGCTAAG 

TACACTTGCC ATGATTGACC ACTTCCATGA ATGTTCTTCT TTTCATCTTA 
GCTAGAGACT 

ATGTTATTAG ATTCGGGTTT TATTTTTAGA TATAAGTATG CATCTAACAT 
35 GGCAATGTTA 

AATTTTTCAA AAGATTTTTC ATGTATTTGA GCAGTTCATG TGTTTGAAGA 
TTAAGATATT 

TCTGAAAAAA AAAAAAAAAA AAAACCGAGG GGGGCCCGGT ACCC 

// 

40 

SEQ ID NO:8 

ID 19WP; PRT; 231 AA. 

DT 25-OCT- 1 999 (CREATED BY PC/GENE PROGRAM TRANSL) 
45 DE 

CC TRANSLATED FROM DNA SEQUENCE 19W (BASES 80 TO 772). 
SQ SEQUENCE 231 AA; 26698 MW; 283450 CN; 

MSMKKEGEIL J ¥5G£G^ 

SCRLRWMNYL 
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SPNVKKGDFS EEEEDLVIRL HKLLGNRWSL IAKRVPGRTD NQVKNYWNSH 
LRKKJLGIIDQ 

NKTRIDFCQS SKQVKVCHVD EAATDPSPGH GTTTETTGIT VDQSNQQEVI 
DHRVLNNTTQ 

ESMTSESYIN TFWIPDHDYE LSTLAMIDHD YELSTLAMID HFHECSSFHL S 



